Using Latent Semantic Indexing for Literature Based Discovery
نویسندگان
چکیده
Latent semantic indexing (LSI) is a statistical technique As described by Swanson, there are two basic literature for improving information retrieval effectiveness. Here, discovery processes. The first leads from the literature we use LSI to assist in literature-based discoveries. The (R) associated with an initial topic to the literatures (I) idea behind literature-based discoveries is that different of one or more related, intermediate topics. The second authors have already published certain underlying scienleads from one of these related topics to the literature tific ideas that, when taken together, can be connected to hypothesize a new discovery, and that these connec(PD) associated with a potential discovery. Figure 1 illustions can be made by exploring the scientific literature. trates these two steps (left to right) . We explore latent semantic indexing’s effectiveness on We call these two processes identifying intermediate two discovery processes: uncovering ‘‘nearby’’ relationliteratures and identifying potential discovery literatures, ships that are necessary to initiate the literature based respectively (Fig. 1) . Our interest is learning if latent discovery process; and discovering more distant relationships that may genuinely generate new discovery semantic indexing (Deerwester et al., 1990), a statistical hypotheses. technique used with success in information retrieval, can help with either or both of these processes.
منابع مشابه
Preliminary Experiments on Literature Based Discovery using the Semantic Vectors Package
This paper presents a literature based discovery (LBD) implementation that uses Lucene for indexing, the Semantic Vectors (SV) package for latent semantic analysis, Neo4j for graph database storage, Gephi for visual representation along with custom code written by the author. The approach of using a latent semantic analysis based systems like SV to do LBD is not new, but going the next steps of...
متن کاملApplying Latent Semantic Indexing in Frequent Itemset Mining for Document Relation Discovery
Word-based relations among technical documents are immensely useful information but often hidden in a large amount of scientific publications. This work presents a method to apply latent semantic indexing in frequent itemset mining to discover potential relations among scientific publications. In this work, two weighting schemes, tf and tfidf are investigated with the exploitation of latent sem...
متن کاملتأملاتی بر نمایه سازی تصاویر: یک تصویر ارزشی برابر با هزار واژه
Purpose: This paper presents various image indexing techniques and discusses their advantages and limitations. Methodology: conducting a review of the literature review, it identifies three main image indexing techniques, namely concept-based image indexing, content-based image indexing and folksonomy. It then describes each technique. Findings: Concept-based image indexing is te...
متن کاملReflective Random Indexing and indirect inference: A scalable method for discovery of implicit connections
The discovery of implicit connections between terms that do not occur together in any scientific document underlies the model of literature-based knowledge discovery first proposed by Swanson. Corpus-derived statistical models of semantic distance such as Latent Semantic Analysis (LSA) have been evaluated previously as methods for the discovery of such implicit connections. However, LSA in part...
متن کاملEnhancing Literature Review Methods - towards More Efficient Literature Research with Latent Semantic Indexing
Nowadays, the facilitated access to increasing amounts of information and scientific resources means that more and more effort is required to conduct comprehensive literature reviews. Literature search, as a fundamental, complex, and time-consuming step in every literature research process, is part of many established scientific methods. However, it is still predominantly supported by search te...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JASIS
دوره 49 شماره
صفحات -
تاریخ انتشار 1998